Multi-speaker Recognition in Cocktail Party Problem

نویسندگان

Yiqian Wang

Wensheng Sun

چکیده

This paper proposes an original statistical decision theory to accomplish a multi-speaker recognition task in cocktail party problem. This theory relies on an assumption that the varied frequencies of speakers obey Gaussian distribution and the relationship of their voiceprints can be represented by Euclidean distance vectors. This paper uses Mel-Frequency Cepstral Coefficients to extract the feature of a voice in judging whether a speaker is included in a multi-speaker environment and distinguish who the speaker should be. Finally, a thirteen-dimension constellation drawing is established by mapping from Manhattan distances of speakers in order to take a thorough consideration about gross influential factors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech separation by simulating the cocktail party effect with a neural network controlled Wiener filter

A novel speech separation structure which simulates the cocktail party e ect using a modi ed iterative Wiener lter and a multi-layer perceptron neural network is presented. The neural network is used as a speaker recognition system to control the iterative Wiener lter. The neural network is a modi ed perceptron with a hidden layer using feature data extracted from LPC cepstral analysis. The pro...

متن کامل

Speaker-Targeted Audio-Visual Models for Speech Recognition in Cocktail-Party Environments

Speech recognition in cocktail-party environments remains a significant challenge for state-of-the-art speech recognition systems, as it is extremely difficult to extract an acoustic signal of an individual speaker from a background of overlapping speech with similar frequency and temporal characteristics. We propose the use of speaker-targeted acoustic and audio-visual models for this task. We...

متن کامل

Improving Source Separation via Multi-Speaker Representations

Lately there have been novel developments in deep learning towards solving the cocktail party problem. Initial results are very promising and allow for more research in the domain. One technique that has not yet been explored in the neural network approach to this task is speaker adaptation. Intuitively, information on the speakers that we are trying to separate seems fundamentally important fo...

متن کامل

\eigenlips" for Robust Speech Recognition \eigenlips" for Robust Speech Recognition

In this study we improve the performance of a hybrid connectionist speech recognition system by incorporating visual information about the corresponding lip movements. Speciically, we investigate the beneets of adding visual features in the presence of additive noise and crosstalk (cocktail party eeect). Our study extends previous experiments by using a new visual front end, and an alternative ...

متن کامل

Auto-associative Memory: The First Step in Solving Cocktail Party Problem

One of the most interesting and challenging problems in the area of Artificial Intelligence is solving the Cocktail Party problem. This is the task of attending to one speaker among several competing speakers and being able to switch the attention from one speaker to another at any given time. Human brain is remarkably efficient in solving this problem. There have been numerous attempts to emul...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

CoRR

دوره abs/1712.01742 شماره

صفحات -

تاریخ انتشار 2017

Multi-speaker Recognition in Cocktail Party Problem

نویسندگان

چکیده

منابع مشابه

Speech separation by simulating the cocktail party effect with a neural network controlled Wiener filter

Speaker-Targeted Audio-Visual Models for Speech Recognition in Cocktail-Party Environments

Improving Source Separation via Multi-Speaker Representations

\eigenlips" for Robust Speech Recognition \eigenlips" for Robust Speech Recognition

Auto-associative Memory: The First Step in Solving Cocktail Party Problem

عنوان ژورنال:

اشتراک گذاری